CD-REST: a system for extracting chemical-induced disease relation in literature

نویسندگان

  • Jun Xu
  • Yonghui Wu
  • Yaoyun Zhang
  • Jingqi Wang
  • Hee-Jin Lee
  • Hua Xu
چکیده

Mining chemical-induced disease relations embedded in the vast biomedical literature could facilitate a wide range of computational biomedical applications, such as pharmacovigilance. The BioCreative V organized a Chemical Disease Relation (CDR) Track regarding chemical-induced disease relation extraction from biomedical literature in 2015. We participated in all subtasks of this challenge. In this article, we present our participation system Chemical Disease Relation Extraction SysTem (CD-REST), an end-to-end system for extracting chemical-induced disease relations in biomedical literature. CD-REST consists of two main components: (1) a chemical and disease named entity recognition and normalization module, which employs the Conditional Random Fields algorithm for entity recognition and a Vector Space Model-based approach for normalization; and (2) a relation extraction module that classifies both sentence-level and document-level candidate drug-disease pairs by support vector machines. Our system achieved the best performance on the chemical-induced disease relation extraction subtask in the BioCreative V CDR Track, demonstrating the effectiveness of our proposed machine learning-based approaches for automatic extraction of chemical-induced disease relations in biomedical literature. The CD-REST system provides web services using HTTP POST request. The web services can be accessed fromhttp://clinicalnlptool.com/cdr The online CD-REST demonstration system is available athttp://clinicalnlptool.com/cdr/cdr.html. Database URL:http://clinicalnlptool.com/cdr;http://clinicalnlptool.com/cdr/cdr.html.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی همراهی بیماری سلیاک و کولیت اولسروز

Background and purpose: Celiac disease (CD) is a genetic disorder that causes inflammation in the lining of the small intestine. Recent literature has shown a relation between inflammatory bowel disease and celiac disease. This study aimed at determining the prevalence of celiac disease in patients with ulcerative colitis (UC). Material and Methods: In this cross-sectional study 84 patients (a...

متن کامل

A crowdsourcing workflow for extracting chemical-induced disease relations from free text

Relations between chemicals and diseases are one of the most queried biomedical interactions. Although expert manual curation is the standard method for extracting these relations from the literature, it is expensive and impractical to apply to large numbers of documents, and therefore alternative methods are required. We describe here a crowdsourcing workflow for extracting chemical-induced di...

متن کامل

A Hybrid System for Extracting Chemical-Disease Relationships from Scientific Literature

We propose a hybrid system for extracting chemical-disease relationships from Medline abstracts. At the core of our approach is a general, rule-based system that extracts causal relations from text, using a combination of trigger lists and syntactic dependencies. We augmented this system with supervised learning. We trained two binary classifiers: one extracts intra-sentential relationships bet...

متن کامل

BELMiner: adapting a rule-based relation extraction system to extract biological expression language statements from bio-medical literature evidence sentences

Extracting meaningful relationships with semantic significance from biomedical literature is often a challenging task. BioCreative V track4 challenge for the first time has organized a comprehensive shared task to test the robustness of the text-mining algorithms in extracting semantically meaningful assertions from the evidence statement in biomedical text. In this work, we tested the ability ...

متن کامل

Sulfasalazine plus Chloroquine-Induced Mood Disorder in a Patient with Rheumatoid Arthritis

Rheumatoid arthritis is a chronic systemic inflammatory disease that affects approximately 0.5-1% of the world population. The current approach to this disease is to start an intensive treatment without delay once the disease has developed. Various studies in the literature have shown that combination of disease modifying antirheumatic drugs such as sulfasalazine and chloroquine offers a more a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 2016  شماره 

صفحات  -

تاریخ انتشار 2016